Laughter detection using ALISP-based N-Gram models
نویسندگان
چکیده
Laughter is a very complex behavior that communicates a wide range of messages with different meanings. It is highly dependent on social and interpersonal attributes. Most of the previous works (e.g. [1, 2]) on automatic laughter detection from audio uses frame-level acoustic features as parameters to train their machine learning techniques, such as Gaussian Mixture Models (GMMs), Support Vector Machines (SVMs) etc. However, segmental approaches that capture higher-level events have not been adequately focussed due to the nonlinguistic nature of laughter. This paper is an attempt to detect laughter regions with the help of automatically acquired acoustic segments using Automatic Language Independent Speech Processing (ALISP) [3, 4] models.
منابع مشابه
Exploiting High-Level Information Provided by ALISP in Speaker Recognition
The best performing systems in the area of automatic speaker recognition have focused on using short-term, low-level acoustic information, such as sepstral features. Recently, various works have demonstrated that high-level features convey more speaker information and can be added to the low-level features in order to increase the robustness of the system. This paper describes a text-independen...
متن کاملFusion for Audio-Visual Laughter Detection
Laughter is a highly variable signal, and can express a spectrum of emotions. This makes the automatic detection of laughter a challenging but interesting task. We perform automatic laughter detection using audio-visual data from the AMI Meeting Corpus. Audio-visual laughter detection is performed by combining (fusing) the results of a separate audio and video classifier on the decision level. ...
متن کاملLinear and non-linear fusion of ALISP-based and GMM systems for text-independent speaker verification
Current state-of-the-art speaker verification algorithms use Gaussian Mixture Models (GMM) to estimate the probability density function of the acoustic feature vectors. They are denoted here as global systems. In order to give better performance, they have to be combined with other classifiers, using different fusion methods. The performance of the final classifier depend on the choice of the s...
متن کاملDetection of laughter in children's speech using spectral and prosodic acoustic features
Laughter is an important para-linguistic cue that can be useful in gauging the affective state of the speaker. In this paper, we present an approach to detecting laughter in children’s speech using acoustic features in the spectral and prosodic domains. Feature selection was performed using the information gain-based technique and a speaker-independent validation using a support vector machine ...
متن کاملAnalysis of Engagement and User Experience with a Laughter Responsive Social Robot
We explore the effect of laughter perception and response in terms of engagement in human-robot interaction. We designed two distinct experiments in which the robot has two modes: laughter responsive and laughter non-responsive. In responsive mode, the robot detects laughter using a multimodal real-time laughter detection module and invokes laughter as a backchannel to users accordingly. In non...
متن کامل